智能论文笔记

Test Time Transform Prediction for Open Set Histopathological Image Recognition

Adrian Galdran. Katherine J. Hewitt , Narmin L. Ghaffari , Jakob N. Kather , Gustavo Carneiro , Miguel A. González Ballester

分类：计算机视觉

2022-06-20

整个幻灯片组织学图像中的组织类型学注释是一项复杂而乏味但既繁琐但必要的任务，用于开发计算病理学模型。我们建议通过将开放式识别技术应用于共同分类属于一组带注释类的组织的任务来解决此问题。临床相关的组织类别，同时拒绝测试时间开放式样品，即属于训练集中不存在的类别的图像。为此，我们引入了一种基于训练模型的开放式组织病理图像识别的新方法，以准确识别图像类别，并同时预测已应用了哪些数据增强变换。在测试时间中，我们测量了模型的置信度预测这种转换，我们期望开放集中的图像较低。在组织学图像的结直肠癌评估的背景下，我们进行了全面的实验，这些实验为我们的方法提供了证据，以自动从未知类别中识别样品的优势。代码在https://github.com/agaldran/t3po上发布。

translated by 谷歌翻译

Convolutional Nets Versus Vision Transformers for Diabetic Foot Ulcer Classification

Adrian Galdran , Gustavo Carneiro , Miguel A. González Ballester

分类：计算机视觉

2021-11-12

本文将良好的卷积神经网络（CNNS）与最近引入了糖尿病脚溃疡分类任务的最近引入的视觉变压器，在DFUC 2021的宏伟挑战的背景下，这项工作达到了第一位置。综合实验表明，现代CNNS仍然能够在低数据制度中表现出变压器，这可能是它们更好地利用空间相关性的能力。此外，我们经验证明最近的清晰度感知最小化（SAM）优化算法显着提高了两种模型的泛化能力。我们的结果表明，对于此任务，CNN和SAM优化过程的组合导致优于任何其他考虑方法的性能。

translated by 谷歌翻译

Curriculum learning for improved femur fracture classification: scheduling data with prior knowledge and uncertainty

Amelia Jiménez-Sánchez , Diana Mateus , Sonja Kirchhoff , Chlodwig Kirchhoff , Peter Biberthaler , Nassir Navab , Miguel A. González Ballester , Gemma Piella

分类：计算机视觉

2020-07-31

来自X射线图像的近端股骨骨折的足够分类对于治疗选择和患者的临床结果至关重要。我们依赖于常用的AO系统，该系统描述了将图像分类为类型和亚型的分层知识树根据裂缝的位置和复杂性。在本文中，我们提出了一种基于卷积神经网络（CNN）自动分类近端股骨骨折的近端骨折分类为3和7 AO类。如已知所知，CNNS需要具有可靠标签的大型和代表性数据集，这很难收集手头的应用。在本文中，我们设计了一个课程学习（CL）方法，在这种情况下通过基本的CNNS性能提高。我们的小说配方团结了三个课程策略：单独加权培训样本，重新排序培训集，以及数据采样子集。这些策略的核心是评分函数排名训练样本。我们定义了两种小说评分函数：一个来自域的特定于域的先前知识和原始的自我节奏的不确定性分数。我们对近端股骨射线照片的临床数据集进行实验。课程改善了近端股骨骨折分类，达到了经验丰富的创伤外科医生的性能。最佳课程方法根据现有知识重新排列培训集，从而达到15％的分类提高。使用公开可用的MNIST DataSet，我们进一步讨论并展示了我们统一的CL配方对三个受控和具有挑战性的数字识别方案的好处：具有有限的数据，在类别 - 不平衡下以及在标签噪声存在下。我们的工作代码可在：https://github.com/ameliajimenez/curriculum-learning-prior -unctainty。

translated by 谷歌翻译

On the Optimal Combination of Cross-Entropy and Soft Dice Losses for Lesion Segmentation with Out-of-Distribution Robustness

Adrian Galdran , Gustavo Carneiro , Miguel Ángel González Ballester

分类：计算机视觉

2022-09-13

我们研究不同损失功能对医学图像病变细分的影响。尽管在处理自然图像时，跨凝结（CE）损失是最受欢迎的选择，但对于生物医学图像分割，由于其处理不平衡的情况，软骰子损失通常是首选的。另一方面，这两个功能的组合也已成功地应用于此类任务中。一个较少研究的问题是在存在分布（OOD）数据的情况下所有这些损失的概括能力。这是指在测试时间出现的样本，这些样本是从与训练图像不同的分布中得出的。在我们的情况下，我们将模型训练在始终包含病变的图像上，但是在测试时间我们也有无病变样品。我们通过全面的实验对内窥镜图像和糖尿病脚图像的溃疡分割进行了全面的实验，分析了不同损失函数对分布性能的最小化对分布性能的影响。我们的发现令人惊讶：在处理OOD数据时，CE-DICE损失组合在分割分配图像中表现出色，这使我们建议通过这种问题采用CE损失，因为它的稳健性和能够概括为OOD样品。可以在\ url {https://github.com/agaldran/lesion_losses_ood}找到与我们实验相关的代码。

translated by 谷歌翻译

Deep Semi-Supervised and Self-Supervised Learning for Diabetic Retinopathy Detection

Jose Miguel Arrieta Ramos , Oscar Perdómo , Fabio A. González

分类：计算机视觉

2022-08-04

糖尿病性视网膜病变（DR）是发达国家工人衰老人群中失明的主要原因之一，这是由于糖尿病的副作用降低了视网膜的血液供应。深度神经网络已被广泛用于自动化系统中，以在眼底图像上进行DR分类。但是，这些模型需要大量带注释的图像。在医疗领域，专家的注释昂贵，乏味且耗时。结果，提供了有限数量的注释图像。本文提出了一种半监督的方法，该方法利用未标记的图像和标记的图像来训练一种检测糖尿病性视网膜病的模型。提出的方法通过自我监督的学习使用无监督的预告片，然后使用一小部分标记的图像和知识蒸馏来监督微调，以提高分类任务的性能。在Eyepacs测试和Messidor-2数据集中评估了此方法，仅使用2％的Eyepacs列车标记图像，分别使用0.94和0.89 AUC。

translated by 谷歌翻译

A Machine Learning Case Study for AI-empowered echocardiography of Intensive Care Unit Patients in low- and middle-income countries

Xochicale Miguel , Thwaites Louise , Yacoub Sophie , Pisani Luigi , Tran Huy Nhat Phung , Kerdegari Hamideh , King Andrew , Gomez Alberto

分类：机器学习

2022-12-30

We present a Machine Learning (ML) study case to illustrate the challenges of clinical translation for a real-time AI-empowered echocardiography system with data of ICU patients in LMICs. Such ML case study includes data preparation, curation and labelling from 2D Ultrasound videos of 31 ICU patients in LMICs and model selection, validation and deployment of three thinner neural networks to classify apical four-chamber view. Results of the ML heuristics showed the promising implementation, validation and application of thinner networks to classify 4CV with limited datasets. We conclude this work mentioning the need for (a) datasets to improve diversity of demographics, diseases, and (b) the need of further investigations of thinner models to be run and implemented in low-cost hardware to be clinically translated in the ICU in LMICs. The code and other resources to reproduce this work are available at https://github.com/vital-ultrasound/ai-assisted-echocardiography-for-low-resource-countries.

translated by 谷歌翻译

The RPM3D project: 3D Kinematics for Remote Patient Monitoring

Alicia Fornés , Asma Bensalah , Cristina Carmona-Duarte , Jialuo Chen , Miguel A. Ferrer , Andreas Fischer , Josep Lladós , Cristina Martín , Eloy Opisso , Réjean Plamondon

分类：人工智能

2022-12-09

This project explores the feasibility of remote patient monitoring based on the analysis of 3D movements captured with smartwatches. We base our analysis on the Kinematic Theory of Rapid Human Movement. We have validated our research in a real case scenario for stroke rehabilitation at the Guttmann Institute5 (neurorehabilitation hospital), showing promising results. Our work could have a great impact in remote healthcare applications, improving the medical efficiency and reducing the healthcare costs. Future steps include more clinical validation, developing multi-modal analysis architectures (analysing data from sensors, images, audio, etc.), and exploring the application of our technology to monitor other neurodegenerative diseases.

translated by 谷歌翻译

Towards Stroke Patients' Upper-limb Automatic Motor Assessment Using Smartwatches

Asma Bensalah , Jialuo Chen , Alicia Fornés , Cristina Carmona-Duarte , Josep Lladós , Miguel A. Ferrer

分类：人工智能

2022-12-09

Assessing the physical condition in rehabilitation scenarios is a challenging problem, since it involves Human Activity Recognition (HAR) and kinematic analysis methods. In addition, the difficulties increase in unconstrained rehabilitation scenarios, which are much closer to the real use cases. In particular, our aim is to design an upper-limb assessment pipeline for stroke patients using smartwatches. We focus on the HAR task, as it is the first part of the assessing pipeline. Our main target is to automatically detect and recognize four key movements inspired by the Fugl-Meyer assessment scale, which are performed in both constrained and unconstrained scenarios. In addition to the application protocol and dataset, we propose two detection and classification baseline methods. We believe that the proposed framework, dataset and baseline results will serve to foster this research field.

translated by 谷歌翻译

A parallelizable model-based approach for marginal and multivariate clustering

Miguel de Carvalho , Gabriel Martos Venturini , Andrej Svetlošák

分类： (统计)机器学习 | 机器学习

2022-12-07

This paper develops a clustering method that takes advantage of the sturdiness of model-based clustering, while attempting to mitigate some of its pitfalls. First, we note that standard model-based clustering likely leads to the same number of clusters per margin, which seems a rather artificial assumption for a variety of datasets. We tackle this issue by specifying a finite mixture model per margin that allows each margin to have a different number of clusters, and then cluster the multivariate data using a strategy game-inspired algorithm to which we call Reign-and-Conquer. Second, since the proposed clustering approach only specifies a model for the margins -- but leaves the joint unspecified -- it has the advantage of being partially parallelizable; hence, the proposed approach is computationally appealing as well as more tractable for moderate to high dimensions than a `full' (joint) model-based clustering approach. A battery of numerical experiments on artificial data indicate an overall good performance of the proposed methods in a variety of scenarios, and real datasets are used to showcase their application in practice.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译